What is vector database? Why is it used?
36
09-Dec-2025
Updated on 09-Dec-2025
Anubhav Kumar
09-Dec-2025What is a Vector?
A vector in AI/ML is a list of numbers (e.g., 768-dimensional, 1536-dimensional) that represents the meaning or features of data.
Examples:
[0.24, -0.11, 0.89, ...][0.72, 0.04, -0.33, ...]These vectors are generated by models like:
What is a Vector Database?
A vector database is a database optimized to:
Popular vector DBs:
Why Is a Vector Database Used?
1. Semantic Search
Traditional keyword search fails if words differ.
Vector search matches by meaning.
Example query:
Finds results like:
2. Retrieval-Augmented Generation (RAG)
When you build an AI assistant or chatbot with your own data (PDFs, docs, SQL dumps), you:
Vector DB = the memory for your AI system.
3. Recommendation Systems
Find similar:
E.g., “users who liked this phone also liked…”
All done through vector similarity.
4. Image / Audio / Video Search
You can search images by description:
Because both images and text can be turned into embeddings.
5. Anomaly Detection
Vectors can detect outliers:
Outlier = vector that is far from cluster.
6. Deduplication and Clustering
Find similar documents or merge duplicates based on semantic similarity.
How Vector Search Works (simple example)
That’s it.
In Short